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Fingerprint Matching 



FIELD OF THE INVENTION 

The inventiQn relate to a method and apparatus for matching fingerprints. 

BACKGROUND OF THE INVENTION 

S Fing^iprinting technology is used to identify media content (siu;h as audio or 

video). An audio or video segment is identified by extracting a fingerprint fixxm it, and 
searching the extracted fingerprint in a database in which fingerprints of known contents are 
stored. Content is identified if the similarity between the extracted fingerprint and the stored 
fingeiprint is deemed sufficient 

10 The piime objective of multimedia fingerprinting is an efficient mechanism to 

establish the perceptual equality of two multimedia objects: not by comparing the (typically 
large) objects themselves, but by comparing the associated fingerprints (small by design). In 
most sr^stems using fingeiprinting technology, the fingerprints of a large number of 
multimedia objects along with its associated metadata (e.g. in the case of song information, 

IS name of artist, title and album) are stored in a database. The fingerprints serve as an index to 
the metadata. The metadata of unidentified multimedia content are then retrieved by 
computing a fingerprint and using this as a quecy in the fingerprint/metadata database. The 
advantage of usmg fingerprints instead of the multimedia content itself is tiu:ee-fi>ld: reduced 
memory/storage requirements as fingerprints are relatively small; efficient coir^arison as 

20 perceptual irrelevancies have already been removed fix>m fingerprints; and efficient searching 
as the data set to be searched is smaller. 

A fingerprint can be regarded as a short summary of an object Therefore, a 
fingerprint function should map an object X consisting of a large number of bits to a 
fingerprint F of only a limited number of bits. There are five main parameters of a fingerprint 

25 system: robustn^s; reliabiUty; fingerprint size; granularity; and search speed (or scalability). 

The degree of robustness of a system determines whether a particular object 
can be correctly identified fix>m a fingerprint in cases where signal degradation is present In 
order to achieve higih robustness the fingerprint F should be based on perceptual features 
which axe invariant (at least to a certain degree) with respect to signal degradations. 
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Pieferably, a severely degraded signal will still yield a similar iBngeiprmt to a fingerprint of 
an original undegraded signal. The 'false rejection rate" (FRR) is generally used to express 
the measure of robustness of the fingerprinting system. A false rejection occurs whenthe 
fingerprints of perceptually similar objects are too different to lead to a positive 
S identification. 

The reliability of a fingerprinting system refers to how often an object is 
identified falsely. In other words, reliability relate to a "false acceptance rate" (FAR) - i.e. 
the probability that two different objects may be falsely declared to be the same. 

Obviously, fingerprint size is important to any fingerprinting system. Jn 
1 0 general, the smaller the fing^print size, the more fingerprints can be stored in a database. 
Fingerprint size is often expressed in bits per second and determines to a large degree the 
memory resources that are needed for a fingerprint database server. 

Granularity is a parameter that can depend on the q^lication and relates to 
how long (large) a particular samgple of an object is required in order to identify it 
15 Searcli speed (or scalability), as it sounds, refers to the time needed in order 

to find a fingerprint in a fingerprint database. 

The above five basic parameters have a large impact on each other. For 
instance, to achieve a lower granularity, one needs to extract a larger fingerprint to obtain the 
same reliability. This is due to the fact that the false acceptance rate is inversely related to 
20 fingerprint size. Another exanq>le: search speed will generally increase when one designs a 
more robust fingerprint 

Having discussed the basic parameters of a fingerprinting system, a general 
description of a typical fingexprinting system is now made. 

A fibog^iprint may be based on extracting a feature-vector from an originating 
25 audio or video signal. Such vectors are stored in a database with reference to the relevant 
metadata (e.g. titie, auflior, ete.). Upon reception of an unknown signal, a feature-vector is 
extracted ftom the unknown signal, which is subsequenfly used as a query on the fingerprint 
database. If the distance between the query featur^vector and its best match in the database 
is below a given threshold, then the two items are declared equal and the associated metadata 
30 are returned: i.e. the received content has been identified. 

Hie threshold that is used in the matching process is a trade-off between the 
felse acceptance rate (FAR) and the false rejection rate (FRR). For instance, increasing the 
threshold (Le. increasing the acceptable "distance" between two fingerprints for those 
fingerprints to still be judged similar) increases the FAR, but at the same time it reduces the 
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FRR. The trade-off between FAR and FRR is usually done via the so-called Neyman-Peaisan 
approach. This means that the threshold is selected to have the smallest value which keeps 
the FAR below a pie-specified, allowable level. The FRR is not used for detennining the 
threshold, but merely results from the selected tbi^hold value. 

US 2002/0178410 Al (Haitsma, Kalker, Baggen and Oostveen) discloses a 
method and ai^aiatus for generating and matching fingerprints of multimedia content In this 
document it is described on page 4 thereof how two 3 second audio clips are declared similar 
if Ihe Hamming distance between two derived fingerprint blocks Hi and H2 is less lhan a 
certain threshold value T. 

In order to anafyse the choice of the threshold T, the authors of US 
2002/0178410 assume that Ihe fingerprint extraction process yields random ii.d. 
(independent and identically distributed) bits. The number of bit errors will then have a 
binomial distiibution with parameters (n, p) where n equals the number of bits extracted and 
p (=0.5) is flie probability that a 0 or 1 bit is extracted. Since n is large, the binomial 
distiibution can be approximated by a normal distribution with a mean fi=np and a standard 
deviation o = ylnp(l~p). Given a fingerprint block Hi, the probability that a randomly 
selected fingerprint block H2 has less than T=a.n errors with respect to Hi is then given by: 

However, in practice robust fingeiprints have hig^i correlation along the time * 
axis. This may be due to the large time correlation of the underlying video sequence, or the 
overly of audio frames. Experim^ts for audio fingerprinte show that the number of 
erroneous bits is normally distributed, but that the standard deviation is a|>proximately 3 
times larger than the i.id case. Equation (1) therefore is modified to include this &ctor 3. 

The above approach assumes that the distribution between the fingerprints is 
stationary. Although this seems to be a reasonable assumptbn for certain technologies, this is 
definitely not the case for video fingerprinting. In video fingerprinting, the amount of 
"activity" in the video is directlb^ reflected in the correlation of liie fingerprint bits: prolonged 
stills lead to constant (i.e., very highly correlated) fingeiprints, whereas a **flashy*' music clip 
will lead to a very low correlation between the fingerprint bits. This non-stationarity leads to 
problems in determining an appropriate value for the fltreshold. 
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OBJECT AND SUMMARY OF THE INVENTION 

It is an aim of embodiments of the present invention to propose an 
aciangement for providing an adaptive thresholding technique. 

5 According to a first aspect of the invention, there is provided a method of 

comparing a query fing^rint to a candidate fingerprint, the method being characterised by 
conq)rising: determining a statistical model of the query fingerprint and/or a candidate 
fibagerprint; and on the basis of Ihe statistical model, deriving a threshold distance within 
which the query fingerprint and the candidate fingerprint may be declared similar. 

10 A second aspect of the invention provides a melhod of matehing a query 

object to a known object, wherem a plurality of candidate fingerprints representing a plurality 
of candidate objects are pre-stored in a database, the method comprising receiving an 
information signal forming part of the query object and constructing a query fingerprint 
therefrom and coirparing the query fingerprint to a candidate fingerprint in the database, tiie 

15 method being characterised in that it further comprises the steps of: determining a statistical 
model for the query fingerprint and/or the candidate fingerprint; and on the basis of the 
statistical model, deriving a threshold distance within which the query fingerprint and the 
candidate fingerprint may be declared similar. 

In the methods of the first and second aspects, the derivation of a threshold 

20 based upon a statistical model of the particular fingerprint provides ad^tive threshold setting 
which may optimise the F A.R. according to query fingerprint type/ internal characteristics 
giving improved matching qualities over the supplication of an arbitrary thresholding system. 

Preferably, if a candidate fingerprint is found to be separated fix>m the query 
fingerprint by a distance less than the threshold distance, and Ihe distance between the 

25 candidate and flie query fingerprint is less than the distance between any other candidate 
fingerprint and the query fingerprint, then the candidate fingerprmt is declared the best 
matching candidate fingerprint and the candidate object represented by the best matching 
candidate fingerprint and the query object rqpmsented by the query fingerprint are deemed to 
be the same. 

30 Preferably, the statistical model comprises the result of performing an internal 

correlation on the query fingerprint and/or Ihe candidate fingerprint. 

Preferably, the fingerprints conq>rise binary values and the statistical model is 
conqiuted for flie query fingerprint by determining a transition probability q for. the query 
fingerprint by determining how many bits of a query fingerprint frame F(m,k:) are different 
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from their corresponding bit in their preceding fingerprint frame F(m4c-1) and dividing the 
number of transitions by a maximum value M*(k-1), which would be obtained if all 
fingerprint bits were of an opposite state to their corresponding preceding bit, where each 
fingerprint comprises M bits per fiame and spans K fiames, in which k is the frame index 
(ranging from 0 to K) and m is the bit-index within a frame (ranging from 0 to M). 

The threshold distance T may then be cornputed fixmi the following equation 
based on a desired False Acceptance Rate (FAR): 



FAR = — erfc 
2 



1 - 2T / l + (1 - 2q)^ 
a/2^ )/i - (1 - 2q)^ 



J 



(4) 



In a third aspect, the invention provides apparatus fer matching a query object 
to a known object, the ^aratus comprising a fingerprint extaction module for receiving an 
information signal forming part of a query object and constracting a query fingerprint 
therefixmi and a fingerprint matching module for comparing the query fingerprint to 
candidate fingerprints stored in a database to one or more candidate fingerprints, the 
apparatus being characterised in that it fiarther comprises: a statistical module for determining 
a statistical model of the query fingerprint and/or one or more of the one or more candidate 
fingerprints; a threshold determiner ,deriving on the basis of the statistical model, a threshold 
distance T within which the query fingerprint and a candidate fingerprint may be declared 
similar; and an identification module arranged such that if a candidate fingerprint is foimd to 
be separated from the query fingerprint by a distance less than the threshold distance T, and 
the distance between the candidate and the query fingerprint is less than the distance between 
any other candidate fingerprint and the query fingerprint, then the candidate fingerprint is 
declared the best matching candidate fingerprint and the candidate object represented by the 
best matdiing candidate fingerprint and the query object represented by the query fingerprint 
are deemed to be the same. 

BRIEF DESCRIPITON OF THE DRAWINGS 

For a better understanding of the invention, and to show how embodiments of 
the same may be carried into effect, reference will now be made, by way of example, to the 
accompanying diagrammatic drawings in which: 

Figure 1 shows a ftmctional block diagram illustrating a fingerprinting method 
with an adaptive threshold in accordance with an embodiment of the inv^tion; 
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Figure 2 is a flow diagram explaising in general flie process involved in 

finding and matclmig fingerprinls in accordance with an embodiment of the invention; 

Figure 3 is a flow diagram illustrating in general the methodology fiw 

determining an adaptive threshold in accordance with an embodiment of the present 

invention; and 

Figure 4 is a flow diagram illustrating a specific adaptive thrediold setting 
methodology in accordance with embodiments of the invention. 

DESCRIPTION OF EMBODIMENTS 

Refendng to Figure 1, there is shown a functional block diagram divided into a 
client side 100 and a database server side 200. At the client side, an object is received by a 
fingerprint ejrtraction module 1 10 and a query fingerprint F computed for tiie object The 
query fingerprint F is, on tiie one hand, passed to an statistical module 1 20 and, on Ibe other 
hand, also passed to tiie database server side 200. The statistical module 120 determines a 
measure of randomness/correlation (for instance, it may determine fbe internal correlation) of 
the query fingerprint F and passes this information to a threshold determiner 130. The 
threshold determiner 130, on tiie basis of the information ftom tiie module 120 ad^tively 
sets a tiireshold level T and passes this threshold level T to the database server side 200. 

At the database server side 200, a matching module 210 receives the query 
fingerprint F ftom the cKent side 100 and looks for tiie best match of that fingerprint within a 
database of known fingerprints. The best matdi information is then passed to a flireshold 
compaiison module 220 to determine wheflier a best matching candidate fingerprint is close 
enough (within threshold distance T) to flie query fingerprint to determine flie identity of tiie 
icput object with tiie matched object corr^onding to the candidate fin^rprint In the case 
where tiie fingerprint F takes binary values, tiie threshold comparison module 220 might, for 
instance, compare tiie Hamming distance between a fingerprint block Hi and a fingerprint 
block H2 relating to flie best mateh in the database 210 and check to see whetiier tiie . . 
TTatnming distance between tiie two blocks is below tiie tiireshold distance T, supplied to flie 
comparison module 220 from the flneshold determinmg module 130. An identification 
decision is made by identification module 230 so that if flie Hamming distance between flie 
two derived fingerprint blocks is below the threshold distance T then flie unidentified query 
object is declared similar to tiie object found in tiie database and tiie relevant metadata is 
returned. 
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In the above desctiptioii the query fingerprint F and the threshold T are sent by 
the client side 100 to the database server side 200. Here, of course, it could be noted that the 
threshold T could also be determined at the database server side 200 and that, ih^fore, 
modifications of the aforementioned block diagram are of course possible. 

SefetringnowtoFigure2, there is shown a flow diagram wMche^lains, in 
general, the operation of the conq>onents of ifae block diagram of Figure 1 in finding and 
matching fingerprints. 

hi a step SlOO, an object sarople ( e-g. in the case of video a short "clq>'0 is 
received and a query fingerprint determined based upon the sample. This query fingerprint 
may be determined in accordance with any suitable prior art method (such as disclosed in 
US 2002/0178410 Al). In a step S200 (reached by pathway "A"), a threshold for the query 
fingerprint is determined in accordance with the particular characteristics 
(randomness/correlation) of the query fingerprint 

In a step S300, which may be carried out in parallel with step S200, the query 
fingerprint is matched to fingerprints held on the database server side 200, to return a best 
matching candidate. Again, this matching process may be performed conventionally, so as to , 
return the closest match to the query fingerprint. 

In the step S300, the '"distance" between the query fingerprint and the best 
match candidate will be determined and, in a step S400, it is checked whether or not the 
''distance" is less than the threshold distance determined in step S200. If the distance between 
the query fingerprint and the best match candidate is found in step S400 to be greater than the 
threshold, then in step SSOO the result is retumed that no matching object to the query object 
has been found On the other hand, if the distance between query fingerprint and best match 
candidate fingerprint is less than the threshold distance in step S400, then in step S600 a 
match is declared between the query object and the object in the database relating to the best 
matching candidate. Metadata ete., of the best matohing object may then be retumed to a 
user. 

In Figure 2, the pathway "A" denoted by the broken lin^ leading to step S200 
fix)m SlOO denote one option for setting a threshold T=T1 based on the query fingerprint 
Alternatively however, pathway "A" may be disregarded and a threshold T=T2 may be based 
upon the characteristics of the best matohing candidate. This possibility is denoted by the 
alternative pathway B from S300 to S200. 
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In a fiirflier alternative, Ihe threshold T may be set based vpan a cQmbjnation 



of the characteristics of both the query fingerprint and the best matching candidate fingerprint 



Figure 3 is a flow diagram illustrating the general methodology for ad^tively 

5 determining a given threshold T. 

In step S210, the query candidate fingerprint is received and a measure of 
randomness of the fingerprint determined, then in step S220 a tbxe^old distance is set 
according to the measure of randomness found in step S210. 

As will be s^jpreciated firom Hie above and from the explanation in relation to 

10 Figure 1, the threshold value T (Tl or T2) used in the comparison is ad^ted to the 

randomness/correlation in either Ihe query-fingCTprint oi^and the best matching candidate. 
More specifically, m the case of tiireshold determination for a query fingerprint, flie 
correlation of the query fingerprint is determined and, fixmi this correlation, the tiueshold to 
be used during matching is cornputed. The less random the internal cocrelation is found to be, 

15 the smaller the threshold distance T can be set without adversely affecting the FRR. 

As stated, the threshold is determined upon the internal correlation of die 
query fingerprint, a best matching candidate fingerprint or a combination of the two. In cases 
where the fingerprint is binary and flie fingerprint-bits behave like a Markov-process, a 
solution can be derived for adaptively setting the threshold. 

20 The solution to the ad^tive threshold setting problem is shown in Figure 4. In 

a step S221, the internal correlation of the fingerprint in question is determined, in step S222 
tiie transition probabilily far the fingerprint is determined based upon tiie internal correlation 
and in step S223, the threshold distance is set ad^tively, based upon both the transition 
probalnlity (explained below) and a desired &lse acceptance rate. 

25 the fingerprint consist ofM bite per fiaine and span K flames. In tiiis case, 

flie fingerprint can be denoted F(nijk), where k is the fisame index (ranging fixjm 0 to K-1) 
and m is the bit-index within a fifame (ranging fixwtn 0 to M-1). Let q denote the probabilily 
that a fingerprint-bit extracted firom frame k is unequal to the corresponding fingerprint bit 
from frame k-1 by (a=Prob[bit(m,k) + bU(m,k-l) ]). This probability q is called flie 
30 transition probability. In this case the correlation increases (coirqjared to the case of purely 
random bits, in which ^1/2) by a facUa 



e.g. by setting a threshold at the average between two derived ad^tive thr^olds Tl, T2. 




(3) 
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As a consequence, the False Acceptance Rate FAR is described by the relation 



FAR = i-erfc 
2 



1-2T jl+(l- 2q)~ ^ 



V2i- Vl-(l-2q)2 ("^^ 

Use of the above relation for computing an ad^ve threshold from the desired 
FAR and the conr^mted transition inrobability q may be summarised as follows: 
Extract fingerpriBt F 

Determine the transition probability q for fingerprint F, as follows: 
(a) Determine how many of the fingerprint bits F(m,k) axe diEfeient fi»m then: 

predecessor F(m4c-1). 

0*) Divide the number of transitions, as confuted in step (a) by the theoretical 

mawmum M*(K-1), which would be obtained if fbr each frame, all fingerprint bits would be 
the apposite fixjm the bits m the previous fiame to determine the transition probability q = 
(number of bit-transitions)/(M*(K-l)). 

Determine flie tiireshold T which is to be used fox matching this specific query . 
fingeipriHt F fix>m the computed vahie q, and a defined pre-agreed False Acceptance Rate 
using relation (4). 

From the above, the tiu-eshold T may be ad^tively set for T=T1 (based on 
correlation of query fingerprint above), or T=T2 (based on correlation of best match 

fingerprint above), or T=T3 (based on a combination of Tl, T2 [e.g.T = xhen, in ' 

2 

the decision stage if the Hamming distance is less than T, declare the underiying objects to be 
the same. 

In the above specific examples of the present invention the tiueshold distance 
is set adaptively based on the internal characteristics of a particular query sample or, indeed, 
of a particular candidate sanq)le or set of samples. However, whilst the specific examples 
described take the internal characteristics m question to be randomness/coxTelation, it will be 
realised tiiat oiher types of statistical distribution might ^ly to certain types of information 
signal and tiiat, therefore, the invention may be legitimately extended to providing adqrtive 
thresholds according to any given ^Ucable "statistical model" to which a query sample or a 
candidate sample fingeiprint is expected to confonn. 

Further, the skilled man win realise that whilst tiie Figure 2 tiunu^ 4 flow 
diagrams show one anangement for inq>lementing the invention, oflier arrangements are 
possible. For instance, ratiier than returning a single best match candidate in step S300 of 
figure 2, a plurality of close matching candidates within a threshold distance may be returned 
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and processed in paraUel (or less advantageously in series) to thereafter calculate the "best" 
match. The invention can also be ^lied using so-called "pnming" techniques in which 
certain candidates within the database can be immediately discarded if it is obvious that they 
can never make a match - searching/matching can tben be done wiflun a much reduced 
search space. 

In accordance with embodiments of Hie invention, methods and apparatus for 
setting an ad^ve tiireshold are disclosed, in which Ihe threshold depends upon specific 
characteristics of a fingerprint The particular method is very suitable for use in matching of 
video content, but is not limited to this. The techniques described may be ^Ued to various 
different areas of technology and various different signal types, including, but not limited to, 
audio signals, video signals, multimedia signals. 

The skilled man will realise that the processes described may be implemented 
in software, hardware, or any suitable combination. 

In summary, tiie invention relates to methods and ^>paratus for fingerprint 
matching. In an embodiment of the invention j^jparatus conqjrismg a fingerprint ejctraction 
module (1 10), a fingerprint matehing module (210), a statistical module (120) and an 
identification module is provided. The fingerprint extraction module (1 10) receives an 
information signal forming part of a query object and constracts a query fingerprint The 
fingerprint matehing module (210) compares tiie query fingerprint to candidates stored in a 
database (215) to find at least one potentially best matehing candidate. Meanwhile, flie 
statistical module determines a statistical model of tiie query fingerprint so as to, for instance, 
determine the statistical distribution of the query fingerprint The tiireshold detenmner (120) 
is arranged, on tiie basis of tiie distribution of flie query fingerprint to derive an adaptive 
tiireshold distance T witiiin which tiie query fingerprint and a potentially best matchmg 
candidate may be declared shnilar by tiie identification module (130). By setting a tiireshold 
in an adaptive manner accordmg to tiie statistical distiibution of tiie query fingerprint, an 
improved felse acceptance rate F A JL and oflier advantages may be achieved. 
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CLAIMS: 



1 • A method of comparing a query fingerprint to a candidate fingerprint, the 

method being characterised by comprising: determining a statistical model of the query 
fingerprint and/or a candidate fingerprint and, on the basis of the statistical model, deriving a 
threshold distance within which the query fingerprint and the candidate fingerprint may be 
declared similar. 

2. A method of matching a query object to a known object, wheiem a plurality of 

candidate fingerprints representing apluraHty of candidate objects are pre-stored in a 
database, the method comprising receiving an information signal forming part of the query 
object and constmcting a query fingerprint therefirom and comparing the query fingerprint to 
a candidate fingerprint in the database, the method being characterised by the further steps of: 
determirung a statistical mo^l for the query fingerprint and/or the candidate ; 
fingerprini; and 

on the basis of the statistical model, deriving a threshold distance within which 
the query fingerprint and the candidate fingerprint may be declared similar. 

3. The method of claim 1 or 2, wherein if a candidate fingerprint is found to be 
separated fi-om the query fingerprint by a distance less than the threshold distance, and fhe 
distance between the candidate and the query fingerprint is less ftan the distance between any 
other candidate fingeiprint and fbs query fingerprint, flien ibe candidate fingerprint is 
declared Ae best matching candidate fingerprint and fhe candidate olgect represented by the 
best matching candidate fingraprint and flie query object represented by the query fingerprint 
are deemed to be the same. 

The method of claim 1, 2 or 3, wherein the statistical model conqnises the 
result of performing an internal correlation on the query fingerprint and/or the candidate 
fingeiprint 
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5. The mefhod of claim 4, wherein the fingerponts comprise a plwality of fiames 
containing binary values and flie statistical model is computed for the query fingerprint by 
determining a transition probabihty q for the query fingerprint by determining how many bits 
of a frame of the query fingerprint F(na) are different from their corresponding bit in Iheir 
preceding fingerprint frame FCmJc-l) and dividing the number of transitions by a maximum 
value M*(k-1), which would be obtained if aU fingerprint bits were of an opposite state to 
their corresponding preceding bit, where each fingerprint comprises M bits per fiame and 
spans K fiames, in which k is the fiame index (ranging fi»m 0 to K) and m is the bit-index 
within a fiame (ranging from 0 to M). 

6. The method of claim 5, wherein flie flireshold distance T is conq>uted fiwn the 
following equation based on a desired False Acceptance Rate (FAR): 

1 - 2T / l + (1 - 2q)^ ^ 
V2^ \1- (1- 2q)2 



FAR = -^«cfc 
2 



7 Apparatus for matching a query object to a Imownobjecti the ^jparatus 

comprising a fingerprint extraction module (1 10) for receiving an information signal formmg 
part of a query object and constructmg a query fingerprint therefrom and a fingeiprint 
matching module (210) for comparing the query fingerprint to candidate fingerprints stored 
in a database (215) to one or more candidate fingeiprints, the ^aiatus being characterised 

by also coir^rising: 

a statistical module (120) for determining a statistical model of the query 
fingerprint and/or one or more of the one or more candidate fingerprints; 

a threshold determiner (120) deriving, on the basis of the statistical model, a 
threshold distance T within which the query fingerprint and a potentiaUy best matching 
candidate fingerprint may be declared similar; and 

an identification module (230) arranged such that if a candidate fingeiprint is 
found to be separated from the query fingeiprint by a distance less ibm the threshold distance 
T, and the distance between the candidate and the query fingerprint is less than the distance 
between any other candidate fimgerprint and the query fingeiprint, then the candidate 
fingeiprint is declared the best matching candidate fingeiprint and flie candidate object 
represented by the best matching candidate fingerprint and the query object represented by 
the query fingerprint are deemed to be the same. 
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8. 



The apparatus of claim 7, wherein fhe statistical module (120) perfoaxis an 



internal correlation on the query fingerprint and/or the one or more candidate fingerprints. 



containing binary values and the statistical module (120) computes the statistical model for 
the query fingerprint oi/and the candidate fingerprint by determining a transition probability 
q by determining how noany bits of a fi'ame of the quety fingerprint F(m,k) are different from 
flieir corresponding bit in the preceding fingerprint frame F(m,k-1) and dividing the number 
10 of transitions by a maximum value M*(k-1), which would be obtained if all fingerprint bits 
were of an opposite state to their corresponding preceding bit, where each fingerprint 
con^rises M bits per frame and spans K firames, in which k is the frame index (ranging firom 
0 to K) and m is the bit-index within a firame (ranging firom 0 to M). 

IS 10. The method of claim 9, wherein the threshold determiner (1 30) comyputes the 

threshold distance T fixmi the following equation based on a desired False Acceptance Rate 



5 



9. 



The method of claim 8, wherein the fingerprints comprise a plurality of fiames 



(FAR): 
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ABSTEIACT: 



The invention relates to methods and apparatus for fingetprint matching. In an 
embodiment of the invention ^paratus comprising a fingerprint extraction module (110), a 
fingerprint matching module (210), a statistical module (120) and an identification module is 
provided The fingerprint extraction module (1 10) receives an information signal forming 
5 part of a query object and constmcts a query fingerprint. The fingerprint matching module 
(210) compares the quety fingerprint to candidates stored in a database (21 5) to find at least 
one potentially best matching candidate. Meanwhile, the statistical module detecmines a 
statistical model of the query fingerprint so as to, for instance, determine the statistical 
distribution of the query fingerprint The threshold determiner (120) is arranged, on the basis 
10 of tiie distribution of tiie query fingerprint to derive an adaptive threshold distance T within 
which the query fingerprint and a potentially best matching candidate may be declared 
similar by the identification module (130). By setting a thresliold in an adaptive manner 
according to the statistical distribution of the query fiaigerprint, an improved &lse acceptance 
rate F A.R. and other advantages may be achieved. 

15 



[Figure 1]. 
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